CONDUITS FOR MULTIPLE DATA SOURCES 



1 Technical Field 

2 The technical field relates to database management, and, in particular, to multiple 

3 data sources management. 

4 Background 

5 Computers are powerful tools for storing and providing access to vast amounts of 

6 information. Computer databases are a common mechanism for storing information on 

7 computer systems while providing easy access to users. A typical database is an 

8 organized collection of related information stored as "records" having "fields" of 

9 information. Between the actual physical databases itself and the users of the system, a 

1 0 database management system is typically provided as a software cushion or layer. In 

1 1 essence, the database management system shields the database user from knowing or 

12 even caring about underlying hardware-level details. All requests from users for access to 

1 3 the data are typically processed by the database management system. In other words, the 

14 database management system provides users with a conceptual view of the database that 

15 is removed from the hardware level. 

16 Applications may be built on top of application programming interfaces (API) that 

17 present information in the form of tables using a relational database model. Additionally, 

18 applications may save data retrieved from an API into a local file or database. However, 

19 an application typically does not save data retrieved from a remote database into a copy 

20 on a local database. 

21 The relational database model used by many database management systems can 

22 manage data from a single source. Faced with multiple data sources, (for example, 

23 multiple connections to the same databases or servers via the API, or live connections to 

24 databases or servers in conjunction with open copies of previously saved data,) a user 

25 typically must launch a completely separate user interface to manage each data source, 

26 because the duplicate data from the various sources described above can lead to 

27 duplication of database keys used to access and manage the data. This is a violation of 

28 the relational database model that cannot be allowed. In order to work with data from 

29 multiple data sources, the user must be able to navigate within the user interface (to 

30 manage data within a data source) and also navigate between interfaces (to manage data 

3 1 across multiple data sources). Therefore, launching separate user interfaces increases the 

32 footprint of the user interface, both in terms of memory used and CPU cycles consumed, 
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1 as shown in Table 1 , while at the same time decreasing the ability of the user to 

2 understand and manage data. 

3 Table 1 



Instance "Footprint" 
contains 


Impact CPU utilization 


Impact memory utilization 


A Java virtual machine 


Yes 


Yes 


Copies of SGM classes 
(written in Java language) 


No 


Yes 


Data specific to clusters, 
nodes, packages being 
managed 


No 


Yes 


Misc. overhead 


Yes 


Yes 



4 

5 Referring to Table 1 , data specific to clusters, nodes, and packages being managed 

6 is the only part of this instance footprint that adds value to a user. This data may be 

7 shown to the user as a set of tables presented by the service guard manager (SGM) API in 

8 the relational database model. The replication of the other parts of the instance footprint, 

9 such as the Java virtual machine (VM), the SGM classes, and overhead, consume 

1 0 resources without providing additional value to the user. 

1 1 Summary 

12 A method for managing data from multiple data sources using conduits includes 

13 maintaining database tables in individual data contexts, ensuring name spaces are unique 

14 within each data context, and combining the database tables into larger tables in a display 

1 5 context. The larger tables may contain data from multiple data sources. A user interface 

1 6 can manage the data from multiple data sources through the conduits. In an embodiment, 

1 7 data changes may be propagated transparently and bi-directionally through the conduits to 

1 8 the data contexts and the associated data sources. 

1 9 The conduits may enable one user interface to manage data from multiple data 

20 sources by allowing multiple data sources and/or different versions of a data source to be 

21 viewed and managed through a single instance of a user interface. As a result, menus, 

22 toolbars and navigational aids may operate across all the data. In addition, the conduits 

23 may shield the user interface from updating each data source individually. 
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1 Description of the Drawings 

2 The preferred embodiments of a method and apparatus for managing data from 

3 multiple data sources using conduits will be described in detail with reference to the 

4 following figures, in which like numerals refer to like elements, and wherein: 

5 Figure 1 illustrates an exemplary method for managing data from multiple data 

6 sources using conduits; 

7 Figure 2 is a flow chart illustrating the exemplary method of Figure 1 for reading 

8 data from multiple data sources using conduits; 

9 Figure 3 is a flow chart illustrating the exemplary method of Figure 1 for writing 

1 0 data to multiple data sources using conduits; and 

1 1 Figure 4 illustrates exemplary hardware components of a computer that may be 

1 2 used in connection with the method for managing data from multiple data sources using 

13 conduits. 

14 Detailed Description 

1 5 A method and apparatus for multiple data source management provides a layer of 

16 abstraction, i.e., conduits, between a data model and the presentation of the data to a user. 

17 The method applies to service guard manager (SGM) module, but can be equally applied 

18 to other database management tools. 

1 9 A SGM module provides a visual tool to manage entities, such as service guard, 

20 service guard oracle parallel server (OPS) edition, metro cluster, continental clusters, and 

21 to maintain high availability (HA). Using the SGM module, operators see color-coded, 

22 graphically-intuitive icons to get the big-picture view of multiple clusters so that they can 

23 proactively manage the clusters, systems (or nodes), and applications. The SGM module 

24 enables operators to quickly identify problems and dependencies with drill-down screens 

25 for more than one HA cluster, and enables operators to quickly know service guard status, 

26 thus minimizing operator training requirements. System administrators can validate the 

27 current service guard cluster, node, and package configuration through visualization. The 

28 following describes the conduits for multiple data sources in connection with the SGM 

29 module. However, one skilled in the art will appreciate that the conduits can be equally 

30 applied to other modules or entities having the same or similar functions. 

3 1 When a data source is open, the data source typically refers to each fundamental 

32 element, such as a cluster, a node, or a package, by a unique identifier. In other words, all 

33 key fields for the database may be unique in individual data context. However, if data 

34 from the individual data context are blended together, or if two data sources are open at 
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1 the same time, the identifiers of the saved file may be the same as the identifiers in the 

2 live connection to the clusters, nodes, and packages. In other words, primary keys of 

3 database tables, i.e., name spaces, are no longer unique, violating the relational database 

4 model. 

5 The method for managing data from multiple data sources using conduits involves 

6 maintaining database tables for each data source in individual data context, and merging 

7 the database tables into larger tables in a display context. The method ensures key values 

8 within the display data context are unique by appending a source identifier as a key field 

9 to the data before combining or updating the database tables in the display context. 

10 Thereafter, the SGM module may effectively interact with the display context, and all the 

1 1 key values may remain as unique. 

12 The conduits may enable one user interface to manage data from multiple data 

1 3 sources by allowing multiple data sources and/or different versions of a data source (for 

1 4 example, previously saved copies of the data) to be viewed and managed through a single 

1 5 instance of a user interface. In addition, the conduits may shield the user interface from 

1 6 the update performance of each data source. This functionality may allow for separate 

1 7 connection management from the display of information, and creation of parallel 

1 8 connections to back-end data to increase performance. Back-end data is typically data 

1 9 retrieved from a server, such as a cluster object manager (COM), via an API. For the 

20 SGM module specifically, shielding the user interface from the update performance of 

2 1 each data source allows for management of continental clusters, where the data must 

22 come from at least two separate data sources. 

23 Figure 1 illustrates an exemplary method for managing data from multiple data 

24 sources using conduits. The exemplary method uses conduits 100 to combine data from 

25 multiple data sources 140 to be displayed on a single display. 

26 A graphical user interface (GUI) instance 1 1 0 typically displays data that the GUI 

27 instance 1 1 0 observes in a context. The method for managing data from multiple data 

28 sources splits this context into a display context 120 and multiple data contexts 150, one 

29 per each data source 140. The display context 120 may act as shared data model for use 

30 by multiple GUI elements. Separate data contexts 150(a), 150(b), 150(c) is typically 

3 1 required because name spaces may not be unique at the data source level. 

32 Every GUI instance 1 10 may interact with a display context 120, which drives the 

33 GUI instance 1 10 via tables that the GUI instance 1 1 0 contains. The tables are internal 

34 data structures from which data is retrieved to draw the GUI elements. The GUI instance 
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1 1 1 0 is typically a running SGM program that runs in a window frame. The frame may 

2 contain GUI elements, such as a title bar, buttons, and a border. Additional GUI elements 

3 may be added, such as a menu bar, a tool bar, a split pane, a tree, a map, and a status bar. 

4 Each of the GUI elements may interact with the tables as appropriate, and may create 

5 appropriate views on the tables in the display context 120. 

6 The GUI instance 1 1 0 may have a tree display 130, which may contain one or 

7 more data sources 140(a), 140(b), 140(c). The data sources 140(a), 140(b), 140(c) 

8 typically contain clusters and unused nodes. Clusters are typically one or more nodes that 

9 are configured to run one or more packages. A data source 140 may be either an open file 

10 or a logical connection to an object manager 1 70, such as a cluster object manager 

1 1 (COM). The display for each data source 140 may contain a sub-tree data context 1 50 

12 that is equivalent to the tree display 130. Each data source 140 in the tree display 130 

1 3 may correspond to one data context 1 50. When a user connects to the object manager 

14 1 70, a list of clusters and unused nodes may be shown on the tree display 130 under a tree 

15 element associated with the data source 140. The data from all the data contexts 150(a), 

16 150(b), 150(c) associated with the data sources 140(a), 140(b), 140(c) may be combined 

17 by the conduits 160 into one display context 120 to create the tree display 130. 

1 8 The schema for a number of tables may define two identification (ID) fields, such 

19 as Id and Id2, or Objectldl and ObjectId2. Before data can be loaded into the main 

20 display context 120, the conduit 100 typically appends a source identifier as a key field to 

21 the data to uniquely define the data source 140. In other words, the conduit 100 uniquely 

22 identifies, as different data sources 140(a), 140(b), 140(c), the two instances of the same 

23 open file, or the two connections with the same cluster in scope. For example, a node 

24 may appear on the tree display 1 30 in two places, both of which may have the same 

25 "name". However, the conduit 100 may append two different source identifiers to the 

26 nodes, so that the necessary uniqueness of name spaces may be restored before the nodes 

27 are merged into the display context 120. 

28 The conduit 100 typically includes a collector 160 and a combiner 165. The 

29 collector 1 60 may retrieve data from a data source 1 40 and input the data into a data 

30 context 150, whereas the combiner 165 may merge all data in the display context 120. 

3 1 Every data context 1 50 for a logical connection to the object manager 170 may be 

32 connected to one or more collectors 160. Each collector 1 60 may have a physical 

33 connection to the object manager 170. Each collector 160 may also be responsible for 

34 managing the physical connection, i.e. error reporting and reconnection. In addition, each 
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1 collector 160 may create views to tables from the object manager 170. When data change 

2 events occur, the collector 160 may update the appropriate tables in the data context 150. 

3 When open data sources 140 are connected to by a user interface, the conduit 100 

4 may move data from the data context 1 5 0 and associated data sources 1 40 to the display 

5 context 120. The conduit 1 00 may join the identification of the data sources 140 with the 

6 table information from the data context 1 50, and update the tables in the display context 

7 120. The GUI elements that depend on views against the tables in the display context 120 

8 may be updated automatically. Other GUI elements may need to be updated explicitly 

9 after detecting changes. 

1 0 Figure 2 is a flow chart illustrating the exemplary method for reading data from 

1 1 multiple data sources using conduits 100. First, database tables may be maintained in 

12 individual data contexts 1 50, step 210. The database tables contain data from multiple 

1 3 data sources 140. Next, the conduits 1 00 may be created to ensure name spaces of the 

14 data are unique within the data contexts 1 50, step 220, by, for example, appending a 

15 source identifier as a key field to the data. Next, the database tables may be combined 

16 into one larger table in a display context 120, step 240. Data from multiple data sources 

17 may be displayed in the display context 120, so that a user interface can display the data 

1 8 from multiple data sources through the conduits 1 00, step 250. Additionally, the database 

19 tables may be updated automatically or explicitly in the display context 120, step 260, 

20 thereby shielding the user interface from updating each data source individually. 

2 1 Figure 3 is a flow chart illustrating the exemplary method for writing data to 

22 multiple data sources using conduits 100. First, after data is added to the display context 

23 120, the conduit 1 00 may request notification of any data changes, step 3 1 0. The user 

24 may later modify the data through the GUI 1 10, step 320. After the conduit 100 is 

25 notified of a data change, the modified data may be delivered to the conduit 1 00, step 

26 330. The data typically contains the unique source identifier appended to the data in step 

27 230. Next, the conduit 100 typically strips the unique source identifier from the data, so 

28 that the resulting data may be in the same original format. The conduit 100 may then 

29 update the data in the data context 1 50, step 350. The conduit 1 00 may also send this 
3 0 changed data back to the obj ect manager 1 70 for propagation back the original source 

31 entities, step 360. 

32 Figure 4 illustrates exemplary hardware components of a computer 400 that may 

33 be used in connection with the method for managing data from multiple data sources 

34 using conduits. The computer 400 includes a connection with a network 4 1 8 such as the 
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1 Internet or other type of computer or telephone networks. The computer 400 typically 

2 includes a memory 402, a secondary storage device 412, a processor 414, an input device 

3 41 6, a display device 410, and an output device 408. 

4 The memory 402 may include random access memory (RAM) or similar types of 

5 memory. The memory 402 may be connected to the network 41 8 by a web browser 406. 

6 The web browser 406 makes a connection by way of the World Wide Web (WWW) to 

7 other computers, and receives information from the other computers that is displayed on 

8 the computer 400. Information displayed on the computer 400 is typically organized into 

9 pages that are constructed using specialized language, such as HTML or XML. The 

10 secondary storage device 412 may include a hard disk drive, floppy disk drive, CD-ROM 

1 1 drive, or other types of non-volatile data storage, and may correspond with various 

12 databases or other resources. The processor 414 may execute information stored in the 

1 3 memory 402, the secondary storage 4 1 2, or received from the Internet or other network 

14 418. The input device 416 may include any device for entering data into the computer 

15 400, such as a keyboard, keypad, cursor-control device, touch-screen (possibly with a 

16 stylus), or microphone. The display device 410 may include any type of device for 

17 presenting visual image, such as, for example, a computer monitor, flat-screen display, or 

1 8 display panel. The output device 408 may include any type of device for presenting data 

19 in hard copy format, such as a printer, and other types of output devices including 

20 speakers or any device for providing data in audio form. The computer 400 can possibly 

21 include multiple input devices, output devices, and display devices. 

22 Although the computer 400 is depicted with various components, one skilled in 

23 the art will appreciate that the computer 400 can contain additional or different 

24 components. In addition, although aspects of an implementation consistent with the 

25 present invention are described as being stored in memory, one skilled in the art will 

26 appreciate that these aspects can also be stored on or read from other types of computer 

27 program products or computer-readable media, such as secondary storage devices, 

28 including hard disks, floppy disks, or CD-ROM; a carrier wave from the Internet or other 

29 network; or other forms of RAM or ROM. The computer-readable media may include 

30 instructions for controlling the computer 400 to perform a particular method. 

3 1 While the method and apparatus for managing data from multiple data sources 

32 using conduits have been described in connection with an exemplary embodiment, those 

33 skilled in the art will understand that many modifications in light of these teachings are 

34 possible, and this application is intended to cover any variations thereof. 
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